STUDIES AND EXPERIMENTS IN THE* 


N86 - 30359 


SOFTWARE ENGINEERING LAB (SEL) 


BY 

FRANK E. MC GARRY 
NASA/GSFC 
AND 

DAVID N. CARD 

COMPUTER SCIENCES CORPORATION (CSC) 


ABSTRACT 

The Software Engineering Laboratory (SEL) Is an organization 
created nearly 10 years ago for the purpose of Identifying* 
measuring and applying quality software engineering techniques 
In a production environment (Reference 1). The members of the 
SEL include NASA/GSFC (the sponsor and organizer)* University of 
Maryland* and Computer Sciences Corporation. Since its inception 
the SEL has conducted numerous experiments, and has evaluated a 
wide range of software technologies. This paper describes 
several of the more recent experiments as well as some of the 
general conclusions to which the SEL has arrived. 

1.0 Background (Chart 1) 

Over the past 9 years* the SEL has conducted studies In 4 major 
areas of software technology: 

1. Software Tools and Environments 

2. Development Methods 

3. Measures and Profiles 

4. Software Models 

Most of these studies have been conducted by utilizing specific 
approaches* tools or models to production software problems within 
the flight dynamics environment at Goddard. By extracting 
detailed information pertaining to the problem* environment* 
process and product* the SEL has been able to gain some Insight 
into the relative impact that the various technologies may have 
on the quality of the software being developed. 

More detailed descriptions of the overall measurement process as 
well as the SEL studies may be found in References 1, 2* and 3. 

This brief paper will describe some of the more recent* specific 
experiments that have been conducted by/in the SEL and just what 
types of insight may be provided In areas of: 

1. Tools and Environments 

2. Software Testing 

3. Design Measures 

4. General Trends 

*The work described in this paper has been extracted from reports and studies carried 
out by members of the SEL. 


F. McGarry 
NASA/GSFC 
1 of 37 



TYPE OF 
SOFTWARE: 


SCIENTIFIC# GROUND-BASED# INTERACTIVE GRAPHIC, 
MODERATE RELIABILITY AND RESPONSE REQUIREMENTS 


LANGUAGES: 85% FORTRAN, 15% ASSEMBLER MACROS 


COMPUTERS: IBM MAINFRAMES# BATCH WITH TSO 


PROJECT CHARACTERISTICS: AVERAGE HIGH 


LOW 


DURATION (MONTHS) 


16 21 13 


EFFORT (STAFF-YEARS) 


8 24 


2 


SIZE (1000 LOC) 
DEVELOPED 
DELIVERED 


57 142 22 

62 159 33 


STAFF (FULL-TIME 
EQUIVALENT) 
AVERAGE 
PEAK 

INDIVUALS 


5 11 2 

10 24 4 

14 29 7 


APPLICATION EXPERIENCE 
(YEARS) 

MANAGERS 6 7 

TECHNICAL STAFF 4 5 


5 

3 


OVERALL EXPERIENCE 
(YEARS) 

MANAGERS 
TECHNICAL STAFF 


10 14 8 

9 11 7 


FIGURE 1. FLIGHT DYNAMICS SOFTWARE 


F. McGarry 
NASA/GSFC 
2 of 37 



The Flight Dynamics environment typically is a FORTRAN environ- 
ment building software systems ranging in size from 10*000 to 
150*000 lines of code - (see Figure 1). 

2.0 Software Tools/Environments* (Chart 2 and Reference 4) 

One of the more interesting studies that was conducted within the 
past several years* was one in which an attempt was made to 
measure the impact of several development approaches (related to 
environment support) on the quality of software within the flight 
dynamics discipline. 

The three points of study include: 

1 . Software Tool s 

2. Computer Support 

3. Number of Terminals/Programmer 

The quality of the product was measured using 4 attributes 
1 ncl ud 1 ng : 

1. Productivity - Number of developed lines of code per man 
month . 

2. Reliability - Number of errors reported per 1*000 lines 
of code. 

3. Effort to Change - (Average number of man hours 
required to make a software modification). 

4. Effort to Repair (Average number of man hours required to 
correct an identified error) 

2.1 Experiment Description (Chart 3) 

In carrying out the study* a review of all projects for which 
detailed project history data was available and complete was 
undertaken. From the completed 50 projects* 14 were selected 
because of the quality and completeness of the relevant data and 
more important! y because of the general similarity of 
complexity of problems that the software was attempting to solve. 

Fourteen projects ranging in size from 11*000 lines of code to 
136*000 lines of code were selected. These projects had 
information describing the environment under which they were 
developed and additional information such as the number and 
quality of automated tools utilized and the number of interactive 
terminals available to the programming staff. 


*Lead investigators of this work included F. McGarry and J. Valett of NASA/GSFC 
and D. Hall of NASA/HQ. 


F. McGarry 
NASA/GSFC 
3 of 37 



The 14 projects selected all dealt with tasks in solving attitude 
determination and control related problems. The projects were 
completed between the years 1978 to 1984. 

The projects also had detailed information as to manhours* size, 
error history* and effort required to make all changes and 
corrections to the software. 

2.2 Project Variations (Chart 4) 

In attempting to characterize each of the development projects* 
a ranking scheme was used for this particular study. It was 
found that the availability of terminals ranged from a low of 
less than 1 per 8 programmers to a high of better than 1 per 2 
programmers. 

There were a total of 21 tools considered in this study that 
were applied by at least some of the projects studied. Such 
tools as documentation aids* preprocessors* test generators and 
program optimizers were among the tools considered. 

It was also found that the distribution of level of use for tools 
ranged from a low of only 1 or 2 automated tools being used* to a 
high of more than 8 automated tool s being used. These tool s al so 
were rated as far as the actual usage by the particular project 
and also there was a rating for each tool of the assessed 
’quality’ of the particular tool. Quality here was rated for 
each tool on a scale of 1 to 5 and was a subjective rating 
determined by the software manager. 

There were a total of 11 characteristics that made up the 
computer support measure. These 11 Included: 

o Terminal Accessibility o Offline Storage 

o Turn around time o Interactive Availability 

o Compiler Speed o Terminals/programmers 

o System Reliability (2 measures) o Avg. CPU Utilization 

oDIrect Storage o A c c esslbil Ity of all 

resources 


2.3 Study Results (Chart 5) 

The results of this particular study were encouraging on the one 
hand and quite perplexing on the other. 

2.3.1 Tool usage results showed that as the number and quality 


F. McGarry 
NASA/GSFC 
4 of 37 



of automated tools Increased, there were significant Increases in 
3 of the 4 quality measures used in this study: 

1. Productivity Increased as tool usage increased 

2. Maintainability (effort to change/effort to repair) 
Improved as the number and quality of tools increased. 

3. Reliability did not seem to be significantly Impacted in 
this one particular study. 

2.3.2 Computer Environment 

Although all of the experimenters felt that there would be 
significant increases in all quality measures as the overall 
quality of computer support increased, none of the measures 
proved to be significant for this one particular study. It could 
not be shown that an improved computer support environment (at 
leastas far as the way the SEL described support environment) 
directly, favorably Impacted the four quality measures used by 
the SEL. 

This particular study is still undergoing further analysis. 

2.3.3 Terminal Usage 

The most perplexing result of this experiment study was the 
one in which the SEL attempted to assess the Impact that 
Increased number of terminals would have on the four measures 
described. 

Although the experimenters expected to observe an Increase in 
both productivity and software reliability as the number of 
terminals made available Increased, the study found just the 
opposite. Both productivity and reliability of software 
decreased as the ratio of terminals available Increased. There 
was no significance in the results for maintainability (effort to 
change/effort for repair). 

Numerous suggestions have been put forth in attempting to explain 
this phenomena. Some felt that the increased terminal usage 
possibly was not properly accompanied with interactive support 
tools in the particular environment. 

Another idea was that the increased terminal availability without 
proper training for the programmers led to a less disciplined 
approach by the programmers. 


F. McGarry 
NASA/GSFC 
5 of 37 



There are several other possible explanations of the results and 
for that reason, this particular study has been continuing and 
will be attempting to more thoroughly analyze this data as well 
as the additional projects that have been completed in this 
env i ronment . 

3.0 Software Testing 

A second general set of studies that has been conducted over the 
past several years within the SEL has been directed toward gaining 
insight into approaches to testing software. Since this phase of 
the development life cycle had previously been determined to 
consume at least 30 percent of the development resources 
(Reference 5), it was deemed as a critically important discipline 
to study. Two major experiments were conducted during 1984 and 
1985 in an attempt to: 

1. Determine the overall coverage of software in the 
typical testing scenario utilized in the flight dynamics 
software development. 

2. Investigate the relative merits of three standard 
testing approaches: 

o functional testing 
o structural testing 
o code reading 


3.1 Test Coverage* (Chart 6 and Reference 6) 

The first experiment on testing was designed to determine the 
extent to which typical testing techniques within the flight 
dynamics environment amply exercised the software that had been 
built. This particular environment utilizes functional testing 
during both the system test phase as well as the acceptance test 
phase. 

By instrumenting a major flight dynamics system, then by 
executing the series of both system tests and acceptance tests - 
experimenters could first determine the coverage attained in the 
test phases. Next, the experimenters monitored the operational 
execution of this same software over a period of months to 
determine the extent to which portions of the completed software 
were utilized. Finally, the experimenters analyzed uncovered 
errors in an attempt to determine if the errors occurred in 
portions of the system that had not been exercised during the 


*The lead investigator for this work was Jim Ramsey of Univ. of MD 


F. McGarry 
NASA/GSFC 
6 of 37 



test phase of development. The software studied was a major 
subsystem of a mission planning tool and consisted of 68 modules 
(Fortran subroutines) with 10,000 lines of code. There were 10 
functional tests making up the acceptance test plan for the 
subsystem and during the operational phase, the experimenters 
monitored 60 operational execution of the software. 

3.1.1 Test Coverage Results (Chart 7) 

The managers of the flight dynamics development systems noted 
that the approach to testing had historically been quite good 
(relatively few errors found In operations) and they expected 
that the coverage found for this one experiment would be quite 
high (few modules would be not executed). The results of the 
experiment showed that for the 10 functional tests executed, only 
75 percent of the 68 modules were executed and less than 60 
percent of the total executable code was covered in the tests. 


Additionally, the series of operational executions showed that a 
slightly higher percentage of both number of modules and lines of 
code were executed for this series of 60 executions. 

Finally, all of the error reports were reviewed to determine in 
which portion of the system the errors had occurred. It was 
found that 8 errors had been recorded during the extended 
operational phase of the software, but it was found that none of 
the reported errors occurred In software that had not been 
executed during the acceptance test phase. 

This Initial study seemed to Indicate that the functional testing 
approach was properly leading to correct portions of the system 
being executed and it also was very representative of the 
operational usage of the software. 

The results of this study indicated that further investigations 
into the various approaches to testing may be worthwhile to 
determine just which approaches were most effective In uncovering 
errors in the software Itself. 

3.2 Software Testing Techniques (Chart 8 and Reference 7) 

Another study was conducted where three programs were seeded with 
a number of faults and 32 professional programmers from NASA/GSFC 
and from Computer Sciences Corporation (CSC) participated In an 
experiment to determine which techniques were effective in 
uncovering these faults. 

The three testing approaches included: 


*The lead investigator for this study was Rick Selby of Uni v. of MD 


F. McGany 
NASA/GSFC 
7 of 37 



o Functional Testing 
o Structural Testing 
o Code Reading 

All programmers participated in applying each of the three 
techniques. 

When performing functional tests, the programmers were required 
to use the functional requirements along with test results to 
isolate faults - they were not to look at the source code itself 
until after testing was completed. 

Those programmers performing structural testing used the source 
code and test results but did not use the functional 
requ i rements . 

Code reading was carried out with no executions of the software. 
Those performing code reading reviewed the requirements and also 
looked at the source code. 

3.2.1 Testing Technique Results (Charts 9 and 10) 

The. results of this experiment indicated that code reading is the 
most effective of the three testing techniques studied. This 
technique uncovered an average of 61 percent of all seeded faults 
while functional testing uncovered 51 percent and structural 
testing uncovered 38 percent. 


Before the test, most of the managers in the SEL felt that code 
reading would prove to be a very effective testing technique, 
although they also felt that it would probably be the most costly 
in manhours to apply; but the results of the experiment indicated 
that code reading also was the most cost effective technique (3.3 
faults per manhour vs 1.8 faults per manhour for structural and 
for functional testing). It was also noteworthy that, before the 
experiment, less than 1 out of 4 persons participating in the 
experiment predicted that code reading would be the most 
effective approach. 

An additional observation that was made after the testing results 
were compiled was that there seemed to be a difference In the 
relative effectiveness of each of the testing approaches as the 
size of the software being tested Increased. For the smaller 
program, code reading was by far the most effective technique, 
but for the larger program, functional testing seemed to be quite 
effective. This observation may indicate that there should be a 
size limit on how much code is utilized in a code reading 
exercise. Further tests are planned for these studies. 


F. McGarry 
NASA/GSFC 
8 of 37 



4,0 Software Measures 


Over the past 6 to 8 years, the SEL has defined, studied, and 
evaluated numerous measures applicable to software development 
and management (References 8, 9, 10). Most of these measures 
have focused on one phase of the software life cycle - the code/ 
unit test phase. In an attempt to define and apply measures in 
earlier phases of the life cycle, the SEL has been reviewing 
several approaches to qualifying or measuring aspects of the 
software during the specifications phase and during the design 
phase. Work on the specification phase was reported at the Ninth 
Software Engineering Workshop and may be found 1ft reference 11 
and 12. One additional piece of work that has been conducted for 
the design phase will be discussed here. 

4.1 Software Design Measures* (Charts 11 and 12 Reference 
13, 14) 

In an attempt to qua! ify software designs, a study was conducted 
to determine if module strength may be utilized as a guideline 
for software mod u 1 ar 1 zat i c n. Although the definitions of 
strength may be wel 1 understood, the parameter may not be easy to 
determine based solely on a structure chart or data flow diagram 
which may be produced during the design phase of software 
devel opment . 

For the purposes of this study, strength is defined as the 
’singleness of purpose’ that a software module inherently 
contains. Singleness of purpose is a subjective parameter 
assigned at design time by the developer/manager. From a list of 
potential functionality that a component may have (e.g. computa- 
tional, control, data processing, etc.) the programmer determines 
which functions that module contains. High strength would be 
attributed to those components which have but a single function 
to perform, medium to 2 and low strength would have three of more 
functions to perform. 

The study examined 450 Fortran modules (from 4 systems) which 
were built by approximately 20 different developers. 

Typical SEL data, which includes detailed cost and error data for 
all modules was also available for all of the modules. The 450 
modules used for this study had a fairly even distribution in 
size as well as in design strength. Small modules (104 of the 
450) were those with up to 31 executable statements, medium (148 
of 450) were those with up to 64 executable statements and there 
were 151 large modules which had more than 64 executable 
statements . 


*The lead investigators for this study were D. Card and G. Page of CSC and 
F. Me Garry of NASA/GSFC 


F. McGarry 
NASA/GSFC 
9 of 37 



The objective of the study was to determine if strength of 
modules as determined at design time was related to the cost and 
reliability of the completed product. 

4.2 Results of the Study on Strength (Charts 13, 14, 15) 

The results of the study in the SEL indicated that module 
strength is indeed a reasonable criteria for defining software 
modularization. When examining the reliability of the 450 
modules, it was found that 50 percent of the high strength 
modules had zero defects while for medium strength modules 36 
percent had zero defects and low strength modules only 18 percent 
of the modules had zero defects. Similar trends were found for 
the modules of medium error proneness (up to 3 errors per 1000 
lines of code) and for modules having a high error rate (over 3 
errors per 1000 lines of code). 

The distribution of the ’buggy’ modules (over 3 errors per 1000 
lines of code) was shown to tend more toward low strength as 
opposed to high strength. Forty- four percent of the buggy 
modules had low strength while only 20 percent of the buggy 
modules were found to have high strength. 

Several additional observations were made while conducting this 
particular study. When the characteristics of the individual 
programmers were reviewed, it was found that those programmers 
who produced high quality software (low error rate and high 
productivity) tended to design modules of high strength but they 
also did not show a preference for writing modules of any 
specific size. Good programmers generated modules of size that 
seemed to best suit their design and they did not artificially 
constrain themselves to writing small modules. 

5.0 General Trends and Observations 

Over the past several years, the SEL has conducted numerous 
studies and experiments in an attempt to better understand the 
impact that various software techniques may have on producing 
improved software. In addition to the specific studies conducted 
such as the ones briefly discussed in sections 2, 3, and 4, the 
SEL has observed general trends in the development and 
measurement of software. The observations include such points as 
trends in software reuse, trends in utilization of improved 
software development technology, and the overall impact of 
improved developed techniques in the cost and reliability of 
software over a long period of observation time. Some of these 
general observations are summarized here. 


F. McGany 
NASA/GSFC 
10 of 37 



5.1 Trends in Computer Use and Technology Application (Charts 
16, 17) 

From data that has been collected on nearly 60 projects over the 
past 9 years, one trend that has been noted is the tendency to 
make heavier and heavier usage of available computer support. In 
1977 and 1978, computer use averaged approximately 100 runs per 
1000 lines of developed source code while in 1982 and 1983 the 
average use increased to nearly 250 runs per 1000 lines of 
source. This trend continues to increase within the flight 
dynamics environment being studied. 

Simultaneously, it was noted that the use of more and more 
structured development practices, improved management approaches 
and overall higher quality software engineering has continually 
increased. Each project has been rated on its application of 
over 200 software techniques (see reference 15) in an attempt to 
quantify the overall level of development and management tech- 
nology util ized for a project. The aggregate of the total set of 
techniques applied results in a rating termed the Software Tech- 
nology Index. From an average index of less than 100 in 1976 to 
1978, it was found that the overall development techniques have 
increased to an average of over 140 in the 1980's. This seems to 
point to improved training, better discipline, improved access to 
tools and possibly better informed management practices. 

Although both parameters (computer use and software technology 
index) seemed to generally Increase over the past 7 or 8 years, 
there is no observed correlation between these two factors. 

5.2 Trends in Software Reuse (Chart 18) 

Another general observation that was made from the detailed 
development data collected by the SEL, was that the reuse of 
software has shown general trends of Increase. Typical software 
systems in the years 1 977 to 1 979 averaged about 15 or 20 percent 
reused code while in the 1982 to 1984 timeframe the average reuse 
has increased to 30 to 35 percent. 

Although this reuse is certainly tending in the right direction, 
the SEL has not conducted detailed studies to determine what the 
driving factors are in improving the percentage of reuse. The 
trends are probably indicative of Improvements in design 
technique as well as numerous other factors, but studies have 
just recently been initiated in the SEL to determine how the 
trend can be improved at a even faster pace. 

It has also been observed in the SEL data that there does not 


F. McGarry 
NASA/GSFC 
11 of 37 



seem to be a direct relationship between projects that are rated 
as having a high software technology index and having a high rate 
of software reuse. But this may not be a surprise since one 
would expect that high technology usage would lead to follow on 
systems being able to pick up or reuse software produced by the 
projects using disciplined approaches for development and 
management . 

5.3 Impact of Development Technologies (Chart 19) 

Probably the most basic goal that the SEL has, is to determine 
the impact that specified software development / management 
techniques have on the cost and reliability of software. With 
nearly 60 projects having been closely monitored over the past 8 
or 9 years»the SEL attempted to look at general trends inthe 
reliability and cost of these projects as measured against the 
software technology index computed for each of these projects. 
The 200 parameters factored into this index represent everything 
from structured techniques to disciplined management approaches 
to configuration control procedures. It is one attempt to 
characterize each of the projects with a single value. 

This technology index correlates very w e 1 1 ( r = . 8 2 ) w i t h 
reliability of software in the SEL. Those projects with a higher 
rating of good development practices were the projects with the 
lower fault rates of the product. 

Unfortunately, the Impact of this technology Index on 
productivity is quite unclear. The first general observation 
that has been made is that there is not a clear favorable impact 
on development cost (cost per line of code) with projects with 
higher values of this technology index. Studies are continuing 
in an attempt to more objectively compute this technology rating 
so that a more conclusive statement can be made. Some 
researchers also have suggested that it is not to be unexpected 
that the specific development cost may not decrease but since 
the reliability has improved and the overall software structure 
has Improved, the maintenance activity will be the beneficiary of 
the overall cost savings, not the development cost. 

5.4 Can Software Technology be Measured? (Chart 20 and Reference 
3) 

Another major question that software engineers address is whether 
or not software technology can be measured at all. By utilizing 
reliability as one major aspect of software quality, the SEL 
attempted to determine to what extent software development/ 
management practices could be measured. 


F. McGarry 
NASA/GSFC 
12 of 37 



There are three levels of development practices which the SEL has 
hoped and attempted to measure. First* there are Individual 
specific techniques such as the use of structured code or chief 
programmer team or the use of PDL in design* etc. 

Second* there Is the usage of a software methodology which is a 
combination of several methods Into a single disciplined 
approach. This could, be the set of methods known as structured 
techniques which reflect the use of 6 or 8 individual practices 
such as top down development* structured code* code reading and 
usage of Unit Development Folders (UDF). 

Finally* the attempt has been made to measure the Impact of the 
total technology Index which encompasses all disciplined 
management/development practices. This signifies the level to 
which the project has attempted to apply recommended software 
development techniques. 

The results of this study indicated; 

1. An individual technique cannot be effectively measured in 
a production environment such as the one In which the SEL Is 
conducting studies, (r = .37 Is a typical value found In 
correlating PDL usage and reliability), 

2. Disciplined methodologies (combining techniques into a 
single disciplined approach) can be measured (r = .65 for one 
particular study) and the approaches called Modern Programming 
Practices (6 techniques) has a significant* measurable* favorable 
impact on software reliability. 

3. Total Software Technology can be measured (r = .82 for 
this one study) and higher levels of applied technology have a 
marked favorable impact on the reliability of software. 

The trends and observations noted here are based on approximately 
8 years of data collection and experimentation within the SEL. 
Approximately 55 projects have been studied and the research is 
continuing and will continue In the future. 

Many of the results are Inclusive* but with each experience and 
study* greater insight is provided into the overall 
characteristics of the software development process. 


F. McGany 
NASA/GSFC 
13 of 37 



REFERENCES 


1. Software Engineering Laboratory, SEL 81-104, The Softw a re 
£n.giJl£££i.!ig. E.a.kg£.a:t££.y» D. N. Card, F. E. McGarry, G. Page, et. 
al, February 1982. 

2. SEL, 81-101, SLuiig ±£ Hals £gll££±i£jll» V. E. Church, D. N. 
Card, F. E, McGarry, et. al, August 1982. 

3. SEL, 86-002, MfiflSlLr.Jng Slid. EyalUit-t.1 fl£ *£££ l££.h.Qglgg.y, 

D. N. Card, F . E, McGarry, J. Valett, to be published 

4. McGarry, F,; Valett, J.; and Hall, D., Me as ur ing Impa ct 

slL ,CgiD£.u:fcg£ B-e.smir.ee Quality mi the. £gj&Ma£g Qeyglegmsnt Process 
And P r g d u C -tJ , Proceedings of the Hawaiian International 
Conference on Systems Sciences, January 1985 

5. McGarry, F . , i What Ha.y£ Me Learned 1 n £ Years * , Proceedings 
of the Seventh Annual Software Engineering Workshop, December 
1982 


6. Ramsey, J., and V. R. Basil 1, lAiialyiilig. ±Ji£ leal E£gg£SS 
Using £±£g£±g£Sl £gy££Sg£ix Proceedings of the Eighth 
International Conference on Software Engineering, August 1985 

7. SEL 85-0001 , Compar 1 son of Software Verification Techniques, 
D. Card,R. Selby, F . McGarry, et. al , April 1985 

8. SEL, 82-004, Cglleeted £si±jiLA££ £ngin£££ing Eagg£S£ Iglmns 1, 
July 1982 


9. SEL 83-003, 
November 1983 


£ giiw.S£S 


■Vglmne 11, 


10. SEL 85-003, Qellggied. £gi±j!ts£s Euginggaing Esgs£Si l&lums. 
Ill, November 1985 

11. SEL 84-003, Inv estigation gi Specification Measures for the 
£gi±MS£g £ggiJl£££illg La£g£a±g£y, w. Agrestl, V. Church, 

F . McGarry, December 1984 

12. Agrestl, W.j lAn. Ag££ga£.h ±g Qgy gigging -Sgggifigntign 
Measures; Proceedings from the Ninth Annual Software 
Engineering Workshop, November 1984 

13. Card, D.; Page, G; McGarry, F.; '££i±g£ia -fg£ £gi±j *n££ 

MgiglS£iSA±±gHj. Proceedings of the Eighth International 
Conference on Software Engineering, August 1985 


F. McGarry 
NASA/GSFC 
14 of 37 



14. Agresti, W.; Card, D.j Church, V.j '.Sininn RfififiJii fin 
■Sfififi-i f i c.aiifin anxi Gfinipn Mfiixififi S±iL£Lifi£.Lt CSC, December 1985 

15. SEL 82-001 , ’E^nlfifilifin fil Mnnapfiinfini Mfi.a.su££.s fil i.oflMniifi 
Devel opment 1 , D. Card, G. Page, F. McGarry, September 1982 


F. McGarry 
NASA/GSFC 
15 of 37 



THE VIEWGRAPH MATERIALS 


for the 

F. McGARRY PRESENTATION FOLLOW 


F. McGany 
NASA/GSFC 
16 of 37 



STUDIES AND EXPERIMENTS 



CL 

O 



(/) 



CD 


in 

oo 

o> 


E 

0) 

o 

0) 

O 


o 

0 

c 


. O) 
C 

^ LU 

0 0) 
-Q 


0 

5 

o 

CO 

75 

c 

c 

< 



o 

CO 

m 

in 

o 

< 

co 

oo 


F. McGarry 
NASA/GSFC 
17 of 37 


CHART 0 




F. McGany 
NASA/GSFC 
18 of 37 






MEASURING THE EFFECTS OF ENVIRONMENT 
ON SOFTWARE DEVELOPMENT 



CM 

I— 

cc 

c 

m 

o 


F. McGarry 
NASA/GSFC 
19 of 37 



EXPERIMENT 


0 

CL 

>* 

h- 

"cd 

L_ 

0 

C 

0 

O 


V) 


0 ~ 

I s 

w| 

o2 

2 l 


•® o> 

o c 

£ >* 

* ^ 


o 

o 


CD 

CO 


o 

o 


E 

o 


CO 

0 

'u' 

cd 


CD 

N 


o LU c/) 


o 

CD 

O 

al 


O) 

c 

> 

b 

•a 

CO 

k. 

CD 

0) C 

if 

c 
o 


2 

cd 

0. 


CO 


> 

O 

a.MM 


UL <0 


O 

>* 


cd 


c 

o . 
■o (5 

<]) CJ 
cd *S 
“ c 

CO O 

o -5 

.© s 

2 =5 

cl = 


■O 

c 

cd 

co 

O) 

c 

ca 

cc 

c 

0 

0 

£ 

•*—» 

0 

CQ 

co 

c 

o 

mmmm 

-*-* 

0 

0 

o 

o 


0 

0 


■D 
0 
C 

E 
0 
x - 
LU 


3 

0 

0 

0 


OO 


a: 

C 

m 

cj 


F. McGany 
NASA/GSFC 
20 of 37 


86A0553.09 



ENVIRONMENT VARIATIONS 



00 


O) O) 

m mmmrn m wb 

X X 

i. v_ 

CD 0 

> > 


0 ® 
CO "O ~ 


o 

CO 


CM CO 
V to 


o 

o _ 

0 CD 


V 5 


CO 


O) 

c 

£ £ 
o o 

>% 

0 

Q 

*o 

c 

o 

sz 

o 

0 00 
DC 

™ >> >, 
u v_ 

o 

0 

CO 

*-» 

0 

CO 

£ 

O 

-1 

0 0 
> > 

A 

o 

CM 

A 

< 




0 

h- 


0 


O 

o 

f- 


0 

n 


0 £ 

D) '■= 
0 0 
0 3 

3 O 


E o o 

3 0 0 
Z h- h- 


JO 

O 

O 

I- 


0 

> 

0 o 

E ® 2 

iZ E 0 

U h C 

c 0 
0 0 
c > 

O p- 

o. o 

0 .3 
3 0 0 
I- DC CO 


O 

0 


0 C 

3 ® 

E i 

1 2 

o > 

c 

LU 


*3“ 

I — 
DC 


<C 

G 


F. MeGarry 
NASA/GSFC 
21 of 37 


86A0553.10 



RESULTS 



■O 

c 

CO 

® © 
■?E 

5 «= 

^ o 
— o 

k- k- 

gw 

2 © 

> 5 
£ ° 

*S w 
= © 
<D ~ 

° » 

• z 

c ►" 

.2 ' 

51 

© Q. 

t © 

OOC 

O o 

© f- 

> ♦- 

II 

fB 

II 

+ 


© 

0) 

ka 

u 

O 

O 

© 

> 

*5 

OD 

© 


C 

o 

© 

© 

fc_ 

k. 

o 

O 

o 

z 

li 

o 


LO 


cc 

<c 

□c 

o 


F. McGarry 
NASA/GSFC 
22 of 37 


86A0553.07 








TEST COVERAGE 



0 

> 

'•M 

o 

.Si, 

A 

O 


0 

c 

o 

o 


O) 


§ £ 

w 

'X © 

° i— 

2 ® 

.2 o 

c 

M> © 

k- 

o cx 

o 0 

S ° 

2 o 
« < 
j- ^ 

O cf 
© © 
£ E 

§ | 

5 c 

Q at 


■o 

© 

3 

o 

© 

X 

LU 

CD 

"D 

O 

O 


"O 

0 

3 

O 

0 

X 

LU 

0 

0 

3 

“O 

O 

5 


t <£ 


0 

O) 

0 

0 

3 



5 „ 
0 <2> 
125 < 4 ^ 

o S 

£ ® 

w W 
© V) 
H © 
3 

-o 

O 


© 

k_ 

0 

a 

E 

o 

O 


0 

*o 

o 

o 

<£ 


*D 

c 

3 

O 

LL 

0 


0 
Q 
C 
0 

tr *- 

a -o lu 

0 c ^ 

8 • * 


0 

0 


>* 

-O 

3 


2 co 


o 

k- 

a. 


E 

« 

k- 

o> 

o 


0 
o 

E 
0 
C 
>% 
O 

•4^ 

■C 

k- O) 

O = 


0 ** 


0 

a 


0 

0 

3 

“O 

o 


GO 

CO 


Q 

o 

-J 

CO 

o 


0 

0 

0 


0 

o 

c 

0 
4— » 

CL 

0 

a 

a 

< 

0 

C 

o 


o 

c 

3 

LL 


0 

0 

0 

0 

o 

0 

0 

3 

0 

c 

o 


— 0 
•«- 


0 

a 

O 

o 

CO 




ctr 

<c 


o 


CM 

CO 

U> 

iO 

O 

< 

0 

CO 


F. McGany 
NASA/GSFC 
23 of 37 




TEST COVERAGE RESULTS 


0 

0 


0 

<D 


CD 

O) 

CO 

03 


CO 

c 

£ 

V-- 

CO 

<D 

Q. 

O 


03 

CD 

H 

03 

O 

c 

CO 

Q. 

03 

O 

O 

< 


•Sp 

0 s 

vO 

O'" 

nP 

0 s 

NO 

0 s 

in 

O 

O 

I s - 

CO 

T— 

00 

CM 


sP 

0 s 

nO 

0 s 

vO 

sP 

0 s - 

CO 

00 

in 

CM 

in 

1— 

h- 



03 

03 

H 


CO Jr 

-s— 0 


O 

I- 


3 

O 

0 

X 

Lii 


> 

UJ 


•D "D 

0 0 


O 

0 

X 

LU 


0 0 
~o ~o 
o o 

o o 

sp sO 
0 s 


■D 

O 


U 

0 

*-* 

0 

0 


0 

o 

c 

0 


0 


0 

:k_ 

0 

Q. 

O 


O *5 


0 

C 

0 

0 

0 


c cc 



0 

0 

1- 

*0 

>* 


2 

k_ 

0 


>4 

O) 

> 

0 

k. 

0 

c 

‘k. 

0 

V. 

•4—* 

> 

3 

0 

> 

o 

LU 

Q 

H 


> 

■D 

T3 

0 

■o 

0 

4 . . 

0 

0 
4— » 

3 

O 

•*—> 

3 

O 

0 

0 

> 

o 

0 

0 

1- 

0 

X 

o 

0 

X 

LU 

c 

O 

LU 

ID 

c 

0 

V) 

0 

0 

•M 

0 

3 

X3 

3 

■D 

3 

0 

UL 

00 

Q. 

0 

O 

2 

O 

2 

O 

O 

< 

sO 

vP 

• 

• 


x: 

o 

0 

O 

i— 

Q. 

CL 

< 


"O 

O o 

0 o 


Vi 

D3 

c 

0 

0 


0 

c 

o 

■mmam 

o 

c 

3 

LL 


C 

0 

E 

c 

o 

V— 

IOB 

> 

C 

LU 


0 .2 0 

Tr "T? 


o 

LL 


CM 

O 

CO 

in 

in 

o 

< 

CO 

CO 


F. McGarry 
NASA/GSFC 
24 of 37 


CHART 7 



STUDIES OF SOFTWARE 
TESTING TECHNIQUES 


O) 


D) 



c 

k- 

<D 

c 

<0 

0 

■3 



0 

0 

CO 

0 

1- 

< 

(0 

0 

1- 

>- 

> 


0 

1— 

cO 

C 



O) 

*-• 

3 

CO 

o C 

CO 

0) 

© 

c 

3 

0 

LL 

2 

o Jg 
c .® 

> 

< 

0 

0 


CD 

3 H 



1- 



LL 





CM CO 
CO 


CO = 

£ O 

Cd k 

o> *- 

O IA 


M 3 

2 O- 

r £ 
O o 
u. O 

CO C\J 


O) 
CD C 
T3 =0 
O (d 
O CD 
CC 


£ r 

2 I 

§> « 

*- O 


,9? Q. ,2 o S 

> <0 > O HI 


F. McGarry 
NASA/GSFC 
25 of 37 





© o 

e ? 

3 © 
Z Q. 




Code Functional Structural Code Functional Structural 

Reading Testing Testing Reading Testing Testing 


Code Reading Proved To Be the Best Technique in Terms of the Total Number 
of Faults Detected and the Faults Detected Per Hour of Effort 

Prior To the Experiment Only 23% of the Subjects Believed Code Reading To 




TESTING TECHNIQUES VS. 
PROGRAM SIZE 

Percent of Faults Found 



E o 

2 a • 

a> *•- .M 
o UJ w 

ft ® p 


o ° 
■° a 

cd < 
^ 'P 

^ o 

O) © 

£ -o 

w O 
0 ) <0 


c « 

3 O 
LL 2 


co 

in 

in 

o 

< 

CD 

CD 


F. McGarry 
NASA/GSFC 
27 of 37 




SOFTWARE DESIGN MEASURES 

Objective 


^ C 

■O o 
c 

<0 CO 
♦ N 

f IB 

o> ■= 


3 5 

co *tr 

> O 

LU CO 


CO 

0 

H— 

L. 

o 


b 

UJ 

0 

0 


3 

o 

■o 

Q 


"O 

CM 

C 



o 

co 

C 

<D 

E 

2 

>> 


o> 

• MB 


0 

0 

0 

c 

CO 

0 

E 

O 

O 

0 

Q 

• MM 
1 - 

c 

u 

« flun 

•D 

T 3 

O) 

o 

LL 

X 

o 

0 
• mm 

0 
■ MB 

*0 

0 

a 

O 

Q. 

0 

0 

IO 

Q. 

0 

0 



< 

D 

Q 

0 

• 

• 

• 

• 



“O 

c 

CO tn 
^ o 
<o co 
<D 
d 

3 K 2 
I— < 
^ CO 

♦ CO 


F. McGany 
NASA/GSFC 
28 of 37 



SOFTWARE DESIGN MEASURES 




in 


co 

m 

in 

o 

< 

co 

co 


F. McGany 
NASA/GSFC 
29 of 37 


CHART 12 



FAULT RATE FOR CLASSES 
OF MODULE STRENGTH 



F. McGarry 
NASA/GSFC 
30 of 37 



DESIGN CHARACTERISTICS 



C-3- 


F. McGariy 
NASA/GSFC 
31 of 37 


86A0553.20 




DESIGN MEASURES SUMMARY 




o 



*♦— 

1 


0 



o 

O) 


C 

0 

1 


0 



0 

0 


*+— 



0 

k. 


L. 

5 


CL 

o 

h- 


O 

z 

-O 


£ 

c 


o 

0 


JZ 

1- 


CO 

0 


0 

l_ 



0 

0 

0 

E 

0 

E 

E 

3 

T3 

E 

0 

O 

0 

O) 

2 

O) 

o 


o 


XI 

k- 

oZ 

•4— > 

O) 

CL 

-o 

c 

-o 

o 

0 

O 

o 

&_ 

O 

O 

CO 

o 

• 


• 


0 

N 

CO 



O 

m.mmm 

H— 

O 

0 

Q. 

CO 



< 


0 

0 


CD 

> 

CO 


W CO 
0 CO 
■= ‘0 


■D 

o 


0 

^ £ 

0 3 

2 0-0 

r- O o 

O) c ? 

| S o> 

o 


O) 3 

X it 

_ r w 

2 i 

0 O 
> -J 

o 


CO 

I 

£ 

o 


c 

0 


w •o 
o o 
cl 2 

52 

0 C 

0 O 
_j CO 

■H- c 
0 0 
o JZ 

O f- 

0 c* 

0 C 
3 © 

"S E 

O 0 
^ 0 
O)05 
(3 

_J -Q 

* 0 
-»-* 

0 3 
v- O 
0 0 

> X 

O LLJ 


0 

N 

CO 


0 

3 

-o 

o 



o 
«*— * 

-O 

0 

0 

0 


DC 


>» 
■*— > 
o 
0 




0 

0 

0 

DC 


3 

0 

LL 


LO 


DC 

< 

3- 

o 


F, McGarry 
NASA/GSFC 
32 of 37 


86A0553.08 



COMPUTER USE AND TECHNOLOGY 

TIME TRENDS 


X o 
CD CO 
“O r- 


O 


o 

CM 


O 

O 


O 

CD 



c 

3 

cc 


o 

CM 


O 

O 

CM 


O 

CO 


o 

CM 


CM 

00 


O 


00 

h- 




CO 

r^ 


o 

oo 


2 

0) 

+* 

o 

© 

o 

CL 


cd 

© 

> 


co 


cc 

< 

n= 

o 


00 

CO 

to 

in 

o 

< 

CO 

oo 


F. McGarry 
NASA/GSFC 
33 of 37 




EFFECT OF TECHNOLOGY ON 
COMPUTER USE 



o 

o 

o 

© 

© 

00 

*• 

o 

CD 

CSI 

eg 

eg 

eg 

'T* 

t- 


<D 

CO 

3 

>% 

05 

O 


o 

c 

x: 

o 


o c 

C <D 
-C <D 
O > 
m > 

*— <D 

m 


<\l 

CO 

in 

o 

< 

GO 


F. McGarry 
NASA/GSFC 
34 of 37 




TRENDS IN SOFTWARE REUSE 
(BASED ON 15 PROJECTS OF 
SIMILAR CHARACTERISTICS) 


O 



O X 
CM CD 
“D 
C 

>* 

O) 

o 

2 o 

o c 

JC 

o 

CD 


03 

CO J— 


o 

CO 



CD 

TT 


>k 

O) 

o 

o 

c 

-C 

o 

0) 


o 

z 


LU 

x> 

a> 

o 

Q> 


09 


CO 

CO 


3 

o 

JC 


T> 
CD 
CD 
3 
a> 
DC 
CD 

k. 

CO 

_ 5 

« $ r 

- > o 

® c W 1 

a- « > 

flj 

C fc- 

co o 

O C 


T3 

c 

CO 


= c CD 

2 CD *r 

03 q) CD 

in cd ^ 

co co O 

co co ^ 

X X ^ 

<D Q) O 

co co Q 

3 3 m 

® © ® 

cc cc $ 

© © >. 

<3 « ■£ 

i 5 o 

w. 

k. 

0 0 3 

CO CO o 


CM 

CM 

CO 

in 

in 

o 

< 

CO 

co 


co 


cc 

<c 

zn 

o 


F. McGarry 
NASA/GSFC 
35 of 37 







EFFECTS OF DEVELOPMENT 
TECHNOLOGIES 


O 



"O A> 

go in o in o 
cl in ^ co co 


o x 

CM © 
1- T 3 
£ 

>> 

CD 

O 

2 o 

o c 

T- JZ 

o 

© 


CO 

2 o 

CO h- 


o 

CO 



o 


o x 

CM 02 
t- T3 

c 


o 

CO 


<0 

w 

o 

o 

(0 

u. 


T> 

O 


© 

>* 



c 


U 

© 


CO 

Q. 

5 


£ 

o 



o 


>* 



n 

<0 

o 

*-» 


o 

> 

© 

> 

cr» 

r— 

(0 

LL 

73 

c 

J— 

© 

© 

<c 

00 

cn 

□c 

o 

c 

© 


© 



o 




>» > 


n 

© 

© 

cr 


o 

:3 

“O 

o 


F. McGarry 
NASA/GSFC 
36 of 37 


86A0553.17 





EFFECT OF TECHNOLOGY USE ON 
SOFTWARE RELIABILITY 


i- 



>* 


3 

CO 

© 

© 

2 


© 

3 

o 

o 

<0 

CL 

E 



> 

© 

Q IL 


^ *n 
© 

© 

3 
CT 
C 
-C 

o 
© 


© 

© 

da 

o 

o 

■O 

o 

£ 

© 



o 

CM 


C£ 

<C 

=C 

o 


F. McGarry 
NASA/GSFC 
37 of 37 


86A0553.23 




